IN THE CLAIMS 



1 . (Original) A computer-implemented process for using feature selection 
to obtain a strong classifier from a combination of weak classifiers, comprising using a 
computer to perform the following process actions: 

(a) inputting a set of training examples, a prescribed maximum number of 
weak classifiers, a cost function capable of measuring the overall cost, and an 
acceptable maximum cost; 

(b) computing a set of weak classifiers, each classifier being associated 
to a particular feature of the training examples, 

(c ) determining which of the set of weak classifiers is the most significant 

classifier; 

(d) adding said most significant classifier to a current set of optimal weak 

classifiers; 

(e) determining which of the current set of optimal weak classifiers is the 
least significant classifier; 

(f) computing the overall cost for the current set of optimal weak 
classifiers using the cost function; 

(g) conditionally removing the least significant classifier for the current set 
of optimal weak classifiers; 

(h) computing the overall cost for the current set of optimal weak 
classifiers less the least significant classifier using the cost function; 

(i) determining whether the removal of the least significant classifier 
results in a lower overall cost; 

G) whenever it is determined that the removal of the least significant 
classifier results in a lower overall cost, eliminating the least significant classifier; 

(k) recomputing each classifier in the current set of optimal weak classifiers 
associated with a feature added subsequent to the eliminated classifier while keeping 
the earlier optimal weak classifiers unchanged; 
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(I) repeat actions (f) through (k) until it is determined the removal of the 
least significant classifier does not result in a lower overall cost and then reinstating the 
last identified least significant classifier to the current set of optimal weak classifiers; 

(m) determining if the number of weak classifiers in the current set of optimal 
weak classifiers equals the prescribed maximum number of weak classifiers or the last 
computed overall cost for the current set of optimal weak classifiers is less than the 
acceptable maximum cost; and 

(n) whenever it is determined that the number of weak classifiers in the 
current set of optimal weak classifiers does not equal the prescribed maximum number 
of weak classifiers and the last computed overall cost for the current set of optimal 
weak classifiers exceeds the acceptable maximum cost, repeating actions (c) through 
(m) until it is determined that the number of weak classifiers in the current set of optimal 
weak classifiers does equal the prescribed maximum number of weak classifiers or the 
last computed overall cost for the current set of optimal weak classifiers becomes less 
than the maximum allowable cost, then outputting the sum of the individual weak 
classifiers as the trained strong classifier. 

2. (Currently Amended) The process of Claim 1 wherein the process 
action of computing each classifier of a set of weak classifiers comprises the process 
action of deriving each classifier based on a histogram of a scalar value feature for face 
training examples and a histogram of a scalar value feature for [[the]] non-face training 
examples. 

3. (Original) The process of Claim 1 wherein the most significant 
classifier includes the feature that is the most likely to predict whether a training 
example matches the classification of a particular classifier. 

4. (Original) The process of Claim 1 wherein the set of weak classifiers 
are designed to classify whether a training example is a face or non-face. 
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5. 

designed 



to 



(Original) The process of Claim 1 wherein the set of weak classifiers is 
classify a training example as a text type. 



6. (Original) The process of Claim 1 wherein the set of weak classifiers is 
designed to classify a training example as a type of document. 

7. (Original) The process of Claim 1 wherein the set of weak classifiers is 
designed to classify a training example as a speech pattern. 

8. (Original) The process of Claim 1 wherein the set of weak classifiers is 
designed to classify a training example as a type of medical condition. 



9. (Original) The process of Claim 1 wherein a weak classifier hj(x) is 
computed as 



P.(x\y = +\,w) P(y = +l) 
log -7— — - + log- 



Pj(x\y = -l,w) ~P(y = -l) 
wherein the probability densities of a feature j for a sub-sample x of a training 
example is denoted by Pj (x \y = +\) for a sought pattern and Pj (x \y = -1) for a non- 
sought pattern and the normalized weights are denoted by w. 

1 0. (Original) The process of Claim 9 wherein the probability density for a 
sought pattern and the probability density for a non-sought pattern can be estimated 
using the histograms resulting from weighted voting of the training examples. 

1 1 . (Original) The process of Claim 9 wherein the process action of 
determining which of the set of weak classifiers is the most significant classifier 
comprises defining the most significant classifier h M (x) as, 

h M (x) = arg min £ f^"'^ , wher ein H' w = {h* (x) | V, } , h(x) = W > an <* " * 

the total number of weak classifiers in the set of weak classifiers. 
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12. (Original) The process of Claim 9 wherein the process action of 
determining which of the set of weak classifiers is the least significant classifier 
comprises defining the least significant classifier h'(x) as, h'= argmin A6W(( , J(H M -h) 

where i/^denotes the strong classifier built upon the current se\H M oi selected weak 
classifiers. 

13. (Original) The process of Claim 1 wherein the process action of 
computing the overall cost comprises computing the overall cost J(h(x)) as 

J(h(x)) = ^ e ~ yMxi) wherein y=+l for a sought pattern and y=-l for a nonsought pattern 

i 

and hx,) is a weak classifier in the set of weak classifiers. 

14. (Original) The process of Claim 1 wherein outputting the sum of the 
individual weak classifiers as the trained strong classifier comprises outputting the sum 
H(x) as H(x) = sign[ h m (*)] wherein M is the total number of weak classifiers in the 
set of weak classifiers h m (x) is a weak classifier in the current set of weak classifiers. 

1 5. (Original) A system for detecting a person's face in an input image and 
identifying a face pose range into which the face pose exhibited by the detected face 
falls, the system comprising: 

a general purpose computing device; and 

a computer program comprising program modules executable by the 
computing device, wherein the computing device is directed by the program modules of 
the computer program to: 

create database comprising a plurality of training feature 
characterizations, each of which characterizes the face of a person at a known face 
pose or a non-face; 

train a plurality of detectors arranged in a pyramidal architecture to 
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determine whether a portion of an input image depicts a person's face having a face 
pose falling within a face pose range associated with one of the detectors using the 
training feature characterizations; and wherein 

said detectors using a greater number of feature characterizations are 
arranged at the bottom of the pyramid, and 

said detectors arranged to detect finer ranges of face pose are 
arranged at the bottom of the pyramid; and wherein the program module to train a 
plurality of detectors comprises sub-modules to, 

(a) input a set of training examples, a prescribed maximum number of 
weak classifiers, a cost function capable of measuring the overall cost, and an 
acceptable maximum cost; 

(b) compute a set of weak classifiers, each classifier being associated to 
a particular feature of the training examples, 

(c ) determine which of the set of weak classifiers is the most significant 

classifier; 

(d) add said most significant classifier to a current set of optimal weak 

classifiers; 

(e) determine which of the current set of optimal weak classifiers is the 
least significant classifier; 

(f) compute the overall cost for the current set of optimal weak classifiers 
using the cost function; 

(g) conditionally remove the least significant classifier for the current set 
of optimal weak classifiers; 

(h) compute the overall cost for the current set of optimal weak classifiers 
less the least significant classifier using the cost function; 

(i) determine whether the removal of the least significant classifier results 
in a lower overall cost; 

0) whenever it is determined that the removal of the least significant 
classifier results in a lower overall cost, eliminate the least significant classifier; 

(k) recompute each classifier in the current set of optimal weak classifiers 
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associated with a feature added subsequent to the eliminated classifier while 

keeping the earlier optimal weak classifiers unchanged; 

(I) repeat actions (f) through (k) until it is determined the removal of the 
least significant classifier does not result in a lower overall cost and then reinstate the 
last identified least significant classifier to the current set of optimal weak classifiers; 

(m) determine if the number of weak classifiers in the current set of optimal 
weak classifiers equals the prescribed maximum number of weak classifiers or the last 
computed overall cost for the current set of optimal weak classifiers is less than the 
acceptable maximum cost; and 

(n) whenever it is determined that the number of weak classifiers in the 
current set of optimal weak classifiers does not equal the prescribed maximum number 
of weak classifiers and the last computed overall cost for the current set of optimal 
weak classifiers exceeds the acceptable maximum cost, repeat actions (c) through (m) 
until it is determined that the number of weak classifiers in the current set of optimal 
weak classifiers does equal the prescribed maximum number of weak classifiers or the 
last computed overall cost for the current set of optimal weak classifiers becomes less 
than the maximum allowable cost, then output the sum of the individual weak classifiers 
as the trained strong classifier. 

16. (Cancelled) 
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